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Abstract 



Brandenburg and (implicitly) Dejean introduced the concept of repetition threshold: the 
smallest real number a such that there exists an infinite word over a fc-letter alphabet that 
avoids /3-powers for all P > a. We generalize this concept to include the lengths of the avoided 
words. We give some conjectures supported by numerical evidence and prove one of these 
conjectures. 

1 Introduction 

In this paper we consider some variations on well-known theorems about avoiding repetitions in 
words. 

A square is a repetition of the form xx, where x is a nonempty word; an example in English 
is hot shots. It is easy to see that every word of length > 4 over an alphabet of two letters must 
contain a square, so squares cannot be avoided in infinite binary words. However, Thue showed 
[14L 1151 0] that there exist infinite words over a three-letter alphabet that avoid squares. 

Instead of avoiding all squares, one interesting variation is to avoid all sufficiently large squares. 
Entringer, Jackson, and Schatz [7] showed that there exist infinite binary words avoiding all squares 
XX with \x\ > 3. Furthermore, they proved that every binary word of length > 18 contains a factor 
of the form xx with |x| > 2, so the bound 3 is best possible. For some other papers about avoiding 
sufficiently large squares, see |Hl ITTl 151 IT^ IT^ . 

Another interesting variation is to consider avoiding fractional powers. For a > 1 a rational 
number, we say that y is an a-power if we can write y = x^x' with x' a prefix of x and \y\ = a\x\. 
For example, the word alfalfa is a 7/3-power and the word tormentor is a |-power. For real 
a > 1, we say a word avoids a-powers if it contains no factor that is a a'-power for any rational 
a' > a. Brandenburg |3] and (implicitly) Dejean [5] considered the problem of determining the 
repetition threshold; that is, the least exponent a = a(k) such that there exist infinite words over an 
alphabet of size k that avoid (a + e)-powers for all e > 0. Dejean proved that a{3) = 7/4. She also 
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conjectured that a(4) = 7/5 and a{k) = k/{k — 1) for k > 5. In its fuh generahty, this conjecture 
is still open, although Pansiot ^01 proved that a (4) = 7/5 and Moulin-Ollagnier [Hj proved that 
Dejean's conjecture holds for 5 < < 11. For more information, see 

In this paper we consider combining these two variations. We generalize the repetition threshold 
of Dejean to handle avoidance of all sufficiently large fractional powers. (Pansiot also suggested 
looking at this generalization at the end of his paper |lf)j . but to the best of our knowledge no one 
else has pursued this question.) We give a large number of conjectures, supported by numerical 
evidence, about generalized repetition threshold, and prove one of them. 

2 Definitions 

Let a > 1 be a rational number, and let £ > 1 be an integer. A word w is a repetition of order a 
and length I if we can write it as id = x^x' where x' is a prefix of \x\ = I, and = a\x\. For 
brevity, we also call w a (a, i) -repetition. Notice that an a-power is an (a, ^)-repetition for some i. 
We say a word is (a, i)-free if it contains no factor that is a (a', £')-repetition for a' > a and i' > £. 
We say a word is {a~^,i)-free if it is (a',£)-free for all a' > a. 

For integers k > 2 and ^ > 1, we define the generalized repetition threshold R{k,i) as the real 
number a such that either 

(a) over a A;- letter alphabet there exists an (a'^,^)-free infinite word, but all (a,£)-free words are 
finite; or 

(b) over a /c-letter alphabet there exists a (a, £)-free infinite word, but for all e > 0, all (a — e, £)- 
free words are finite. 

Notice that R{k, 1) is essentially the repetition threshold of Dejean and Brandenburg. 

Theorem 1 The generalized repetition threshold R{k, tj exists and is finite for all integers k > 2 
and £>1. Furthermore, l + £/k^ < R{k,£) < 2. 

Proof. Define S to be the set of all real numbers a > 1 such that there exists a {a,£)-fiee 
infinite word over a /c-letter alphabet. Since Thue proved that there exists an infinite word over a 
two-letter alphabet (and hence over larger alphabets) avoiding (2 + e)-powers for all e > 0, we have 
that P = inf S exists and (3 < 2. If /? G 5, we are in case (b) above, and if /3 ^ 5, we are in case 
(a). Thus R{k,£) = p. 

For the lower bound, note that any word of length >k^ + £ contains >k^ + l factors of length 
£. Since there are only k^ distinct factors of length £, such a word contains at least two occurrences 
of some word of length £, and hence is not (1 + ^)-free. ■ 

Remarks. 

1. It may be worth noting that we know no instance where case (b) of the definition of gener- 
alized repetition threshold above actually occurs, but we have not been able to rule it out. 

2. Using the Lovasz local lemma, Beck has proved a related result: namely, for all e > 0, 
there exists an integer n' and an infinite (1 + n/{2 — e)*^, n)-free binary word for all n > n' . Thus 
our work can be viewed as a first attempt at an explicit version of Beck's result (although in our 
case the exponent does not vary with n). 
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3 Conjectures 



In this section we give some conjectures about R{k 
Conjecture 2 

R{k,2) -- 



{3k — 2) /{3k — 4), if k is even; 
{3k — 3) /{3k — 5), if k is odd. 



Conjecture 3 R{3, £) = 1 + j for i > 2. 

These conjectures are weakly supported by some numerical evidence. The following table gives 
the established and conjectured values of R{k,i). Entries in bold have been proved; other entries, 
in light gray, are merely conjectured. If the entry for {k, t) is a, then we have proved, using 
the usual tree-traversal technique discussed below, that there is no infinite (a, £)-free word over a 
/c-letter alphabet. 
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Figure 1: Known and conjectured values of R{k,l). 



The proved results are as follows: 

• R{2, 2) = 1 follows from Thue's proof of the existence of overlap-free words over a two-letter 
alphabet [HIiaE]; 

• R{2, 2) = 2 follows from Thue's proof together with the observation of Entringer, Jackson 
and Schatz [7]; 
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• i?(3, 1) = 7/4 is due to Dejean 

• i?(4, 1) = 7/5 is due to Pansiot HHI; 

• R{k, 1) = k/{k - 1) for 5 < A; < 11 is due to Moulin-Ollagnier 0; 

• R{3, 2) = 3/2 is new and is proved in Sectional 

We now explain how the conjectured results were obtained. We used the usual tree-traversal 
technique, as follows: suppose we want to determine if there are only finitely many words over the 
alphabet S that avoid a certain set of words S. We construct a certain tree T and traverse it using 
breadth- first search. The tree T is defined as follows: the root is labeled e (the empty word). If a 
node w has a factor contained in S, then it is a leaf. Otherwise, it has children labeled wa for all 
a € S. It is easy to see that T is finite if and only if there are finitely many words avoiding S. 

We can take advantage of various symmetries in S. For example, if S is closed under renaming 
of the letters (as is the case in the examples we study), we can label the root with an arbitrary 
single letter (instead of e) and deduce the number of leaves in the full tree by multiplying by 

If the tree is finite, then certain parameters about the tree give useful information about the 
set of finite words avoiding S: 

• If /i is the height of the tree, then any word of length > h over S contains a factor in S. 

• If M is the length of longest word avoiding S, then M = h — 1. 

• If / is the number of internal nodes, then there are exactly / finite words avoiding S. Fur- 
thermore, if L is the number of leaves, then (as usual), L = 1 + (|S| — 1)1. 

• If /' is the number of internal nodes at depth h — 1, then there are I' words of maximum 
length avoiding S. 

Table 2 gives the value of some of these parameters. Here a is the established or conjectured 
value of R{k,£) from Table 1. 

We have seen how to prove computationally that only finitely many (a,^)-free words exist. 
But what is the evidence that suggests we have determined the smallest possible a? For this, we 
explore the tree corresponding to avoiding (a"*", ^)-repetitions using depth-first (and not breadth- 
first) search. If we are able to construct a "very long" word avoiding (a"*", £)-repetitions, then we 
suspect we have found the optimal value of a. For each unproven a given in Table 1, we were able 
to construct a word of length at least 500 (and in some cases, 1000) avoiding the corresponding 
repetitions. This constitutes weak evidence of the correctness of our conjectures, but it is evidently 
not conclusive. 

To show what can go wrong, the data we presented evidently suggests the conjecture R{2, 1) = 
{i + 2)/L But we have proven this is not true, since the tree avoiding (1.2608, 8)-repetitions is 
finite, with height 195 and 53699993 internal nodes. (Perhaps R{2,8) = 29/23.) 
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Figure 2: Tree statistics for various values of k and / 
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4 A new result 



In this section we prove the following new result: 
Theorem 4 R{3,2) = |. 

Proof. From the numerical results reported in Table 2, we know that there exist no infinite 
words over a 3-letter alphabet avoiding (|, 2)-repetitions. It therefore suffices to exhibit an infinite 
word over a 3-letter alphabet that avoids (|^, 2)-repetitions. 

Now consider the uniform morphism h : {0, 1, 2, 3}* — > {0, 1, 2}* defined by 

h{0) = 000211, h{2) = 020011, 

h{l) = 101221, h{3) = 120221. 

By a result of JU], there exist |'''-free infinite words over four letters. Consider one such word x. 
We will prove that /i(x) is (|, 2)"'"-free. 

We notice first the following synchronising property of the morphism h: for any a,b,c € 
{0,1,2,3} and s,r £ {0,1,2}*, if h{ab) = rh{c)s, then either r = e and a = c or s = e and 
b = c. This is straightforward to verify. 

We now argue by contradiction. Assume /i(x) has a repetition xyx such that \x\ > \y\. If 
|x| > 11, then each occurrence of x contains as factor at least one full /i-image of a letter. By 
the above synchronising property, the second x will contain the same full images and at the same 
positions, say x = x'x"x"' with x" = h{u), \x'\ < 5, \x"'\ < 5. Therefore, ^(x) contains the factor 
x' h{u)x"' yx' h{u)x"' and x has the factor uvu, where h{v) = x"'yx' . We next compute the order of 
this repetition in x. Assuming \x\ > 50, we have \x\ > 5|x'x"'| and so 

\uvu\ ^ \x"\ _ \x\ — \x'x"'\ 7 
\ — ^ = 1 + ^ — = 1 + ',',, ' > -, 
\uv\ \x\ + \y\ \x\ + \y\ 5 

a contradiction, since x is ^ -free. For the case when \x\ < 50, it can be shown by exhaustive 
search that the only possibility for such a repetition in h(x) is \xy \ = 1. Thus, /i(x) is (|,2)+-free. 
■ 

Remarks. 

1. We needed much less than \x\ > \y\ in obtaining the contradiction. In fact, |^ > | + (5, for 
some (5 > is sufficient. What we obtain is, for any 5 > there exists k = k{6) such that /i(x) is 
{l + 6,k{5))-bee. 

2. The set of those x, y for which we need to check (|, 2)"'"-freeness can be substantially reduced 
by a more detailed analysis. (For instance, we bounded by 10, the simplest bound, but this 
can be significantly reduced.) We used |x| < 50 in order to simplify the proof. 
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